Comparing Feature Matching for Visual Object Categorization: MAX vs. Bag-of-Words
نویسندگان
چکیده
In this paper we address the comparison of two feature matching techniques which can be integrated in the HMAX framework. This comparison involves the originally proposed MAX technique and the histogram technique originating from Bag-of-Words literature. We have found that each of these techniques have their own field of operation. The histogram technique clearly outperforms the MAX technique with 5–15% for small dictionaries up to 500–1,000 features. A second investigation concentrates on comparing the often used hard vector quantization technique and a soft matching score technique for the histogram creation. It was found that the difference in performance is not significant and the scores are often within their standard deviations. Aiming at an embedded implementation such as in a surveillance system, computation power and memory (number of dictionary features) are intrinsically limited, so that the histogram technique is favored over the MAX technique.
منابع مشابه
Comparing Feature Matching for Object Categorization in Video Surveillance
In this paper we consider an object categorization system using local HMAX features. Two feature matching techniques are compared: the MAX technique, originally proposed in the HMAX framework, and the histogram technique originating from Bag-of-Words literature. We have found that each of these techniques have their own field of operation. The histogram technique clearly outperforms the MAX tec...
متن کاملContextual-Guided Bag-of-Visual-Words Model for Multi-class Object Categorization
Bag-of-words model (BOW) is inspired by the text classification problem, where a document is represented by an unsorted set of contained words. Analogously, in the object categorization problem, an image is represented by an unsorted set of discrete visual words (BOVW). In these models, relations among visual words are performed after dictionary construction. However, close object regions can h...
متن کاملConcept-Specific Visual Vocabulary Construction for Object Categorization
Recently, the bag-of-words (BOW) based image representation is getting popular in object categorization. However, there is no available visual vocabulary and it has to be learned. As to traditional learning methods, the vocabulary is constructed by exploring only one type of feature or simply concatenating all kinds of visual features into a long vector. Such constructions neglect distinct role...
متن کاملRecognizing in the depth: Selective 3D Spatial Pyramid Matching Kernel for object and scene categorization
This paper proposes a novel approach to recognize object and scene categories in depth images. We introduce a Bag of Words (BoW) representation in 3D, the Selective 3D Spatial Pyramid Matching Kernel (3DSPMK). It starts quantizing 3D local descriptors, computed from point clouds, to build a vocabulary of 3D visual words. This codebook is used to build the 3DSPMK, which starts partitioning a wor...
متن کاملUnderstanding bag-of-words model: a statistical framework
The bag-of-words model is one of the most popular representation methods for object categorization. The key idea is to quantize each extracted key point into one of visual words, and then represent each image by a histogram of the visual words. For this purpose, a clustering algorithm (e.g., K-means), is generally used for generating the visual words. Although a number of studies have shown enc...
متن کامل